Update Method of Cost Function to Learn Robust Policy Parameters

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Discretionary Monetary Policy under Cost-Push Shock Uncertainty of Iran’s Economy

T here is always uncertainty about the soundness of an economic model’s structure and parameters. Therefore, central banks normally face with uncertainty about the key economic explanatory relationships. So, policymaker should take into account the uncertainty in formulating monetary policies. The present study is aimed to examine robust optimal monetary policy under uncertainty, by ...

متن کامل

Robust method to retrieve the constitutive effective parameters of metamaterials.

We propose an improved method to retrieve the effective constitutive parameters (permittivity and permeability) of a slab of metamaterial from the measurement of S parameters. Improvements over existing methods include the determination of the first boundary and the thickness of the effective slab, the selection of the correct sign of effective impedance, and a mathematical method to choose the...

متن کامل

‏‎interpersonal function of language in subtitling

‏‎translation as a comunicative process is always said to be associated with various aspects of meaning loss or gain. subtitling as a mode of translating, due to special discoursal and textual conditions imposed upon it, is believed to be an obvious case of this loss or gain. presenting the spoken sound track of a film in writing and synchronizing the perception of this text by the viewers with...

15 صفحه اول

Surprise-modulated belief update: how to learn within changing environments?

Abstract I We propose a new framework for surprise-driven learning that can be used for modeling how humans and animals learn in changing environments. It approximates optimal Bayesian learner, but with significantly reduced computational complexity. I This framework consists of two components: (i) a confidence-adjusted surprise measure to capture environmental statistics as well as subjective ...

متن کامل

Learn++ for Robust Object Tracking

In this paper, a Learn++ (LPP) tracker is proposed to efficiently select specific classifiers for robust and long-term object tracking. In contrast to previous online methods, LPP tracker dynamically maintains a set of basic classifiers which are trained sequentially without accessing original data but preserving the previously acquired knowledge. The different subsets of basic classifiers can ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Institute of Systems, Control and Information Engineers

سال: 2020

ISSN: 1342-5668,2185-811X

DOI: 10.5687/iscie.33.191